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Aerobic anoxygenic phototrophic bacteria (AAPB) are important functional groups and are widely distributed in the global up- 
per ocean. Here we report the draft genomic sequences of two marine AAPB isolates belonging to the genus Erythrobacter, Eryth- 
robacter longus strain DSM 6997 and Erythrobacter litoralis strain DSM 8509. 
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Aerobic anoxygenic phototrophic bacteria (AAPB) play signif- 
icant roles in the carbon cycle in the oceans (1). They are 
obligate aerobes and perform photoheterotrophic metabolism 
(2). AAPB contain bacteriochlorophyll a (BChl a), are widely dis- 
tributed in the class Proteobacteria, and are closely related to pur- 
ple photosynthetic bacteria (3-5). The first isolate of marine 
AAPB was Erythrobacter longus DSM 6997 (also named OCh 101) 
(6). Later, more species of AAPB were isolated from diverse ma- 
rine environments (3, 7). Recently, the genome of another type 
strain, Erythrobacter sp. strain NAP1, has been sequenced, which 
does not contain genes involved in CO z or N 2 fixation that con- 
firmed the photohetero trophy of the AAPB (8). 

Erythrobacter longus strain DSM 6997, isolated from the sur- 
face of green seaweed Enteroporpha linza, is a type strain of the 
Erythrobacter genus (3). The strain DSM 6997 is characterized by 
Gram-negative and orange-pigmented rods (6) and has the ability 
to store polyhydroxyalkanoate (PHA) as a carbon source in the 
cell (9). Moreover, another species, Erythrobacter litoralis strain 
DSM 8509, was isolated from a cyanobacterial mat sample (10). 
The strains DSM 6997 and DSM 8509 are both slightly halophilic 
and multiply by binary division (10). Here we report the draft 
genomes of these two type strains of the Erythrobacter genus. 

The draft genomes of these two strains were obtained by Illu- 
mina mate-paired sequencing technology. Mate-paired reads of 
average 100-bp length were assembled using Velvet software 
(vl.2.03) (11). Total contig sizes of -3.55 Mbp with an average of 
680X coverage of strain DSM 6997 and -3.17 Mbp with an aver- 
age of 780 X coverage of strain DSM 8509 were obtained. The 
overall G+C contents of strains DSM 6997 and DSM 8509 are 
57.37% and 65.15%, respectively. The open reading frames 
(ORFs) were identified using GLIMMER (12). Gene prediction 
was performed using BLASTn against the non-redundancy (nr) 
nucleotide database to identify the orthologous sequences. A total 
of 3,078 protein-coding genes were obtained in the genome of 
strain DSM 6997 and 2,793 genes were obtained in strain DSM 
8509. tRNAscan-SE was employed to identify the tRNA sequences 
(13), followed by rRNA identification performed using RNAm- 



mer software (14). There are 3 rRNAs and 42 tRNAs and 4 rRNAs 
and 44 tRNAs in the genomes of strains DSM 6997 and DSM 8509, 
respectively. 

Gene annotation was performed using BLASTn against the nr 
database and KEGG protein database (15). There are 2,023 pro- 
teins and 2,118 proteins that annotated to clear functions in the 
genomes of strains DSM 6997 and DSM 8509, respectively. The 
functions of the genes are defined by association with clusters of 
orthologous groups (COG) classification against the conserved 
domains database (CDD) (16) and the KEGG pathway collection 
( 17). There are 2,281 proteins that are classified to COG categories 
in the genome of strain DSM 8509 and 2,329 proteins are assigned 
to different COG categories in the DSM 6997 genome. Relatively 
high similarities of DSM 6997 and DSM 8509 to Erythrobacter sp. 
NAP1 are observed. 

Nucleotide sequence accession numbers. These whole- 
genome shotgun projects have been deposited at DDBI/EMBL/ 
GenBank under the accession no. JMIW00000000 and 
JMIX00000000 for strain DSM 6997 and strain DSM 8509, respec- 
tively. The versions described in this paper are versions 
JMIW0 1000000 and JMIX0 1000000. 
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